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Abstract 

We consider the extent to which one can compute bounds on the rate 
of convergence of a sequence of ergodic averages. It is not difficult to con- 
struct an example of a computable Lebesgue-measure preserving transfor- 
mation of [0, 1] and a characteristic function / — xa such that the ergodic 
averages A„f do not converge to a computable element of L^{[0, 1]). In 
particular, there is no computable bound on the rate of convergence for 
that sequence. On the other hand, we show that, for any nonexpansive 
linear operator T on a separable Hilbert space, and any element /, it is 
possible to compute a bound on the rate of convergence of {A„f) from T, 
f, and the norm ||/*|| of the limit. In particular, if T is the Koopman oper- 
ator arising from a computable ergodic measure preserving transformation 
of a probability space X and / is any computable element of L^{X), then 
there is a computable bound on the rate of convergence of the sequence 
iA„f). 

The mean ergodic theorem is equivalent to the assertion that for every 
function K{n) and every e > 0, there is an n with the property that the 
ergodic averages Amf are stable to within e on the interval [n,K{n)]. 
Even in situations where the sequence (A„/) does not have a computable 
limit, one can give explicit bounds on such n in terms of K and ||/||/e. 
This tells us how far one has to search to find an n so that the ergodic 
averages are "locally stable" on a large interval. We use these bounds to 
obtain a similarly explicit version of the pointwise ergodic theorem, and 
show that our bounds are qualitatively different from ones that can be 
obtained using upcrossing inequalities due to Bishop and Ivanov. 

Finally, we explain how our positive results can be viewed as an ap- 
plication of a body of general proof-theoretic methods falling under the 
heading of "proof mining." 



1 Introduction 

Let r be a nonexpansive linear operator on a Hilbert space Ti., that is, a linear 
operator satisfying ||r/|| < ||/|| for all / e H. For each n > 1, let Snf = 
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/ + Tf + . . . + T^-'^f denote the sum of first n iterates of T on /, and let 
-^nf — ^Snf denote their average. The von Neumann mean ergodic theorem 
asserts that the sequence {Anf) converges in the Hilbert space norm. The most 
important instance of the mean ergodic theorem occurs when Ti is the space 
L^(A') of square-integrable functions on a probabihty space X = {X,B,fi), and 
T is the Koopman operator Tf = / o r associated to a measure preserving 
transformation, r, of that space. In that setting, the Birkhoff pointwise ergodic 
theorem asserts that the sequence (A„/) converges pointwise almost everywhere, 
and in the norm, for any / in L^{X). The transformation t is said to be 
ergodic if there are no nontrivial invariant subsets, in the sense that t~^{A) = A 
implies ^jl{A) — or = 1. In that case, lim„^oo Anf is a.e. equal to the 

constant function J f d/j., which is to say, almost every sequence of time averages 
converges to the average over the entire space. 

It is known that, in general, the sequence (Anf) can converge very slowly. 
For example, Krengel |24j has shown that for any ergodic automorphism of the 
unit interval under Lebesgue measure, and any sequence (a„) of positive reals 
converging to 0, no matter how slowly, there is a subset A of the interval such 
that, if XA denotes the characteristic function of A, then 

lim —\{AnXA){x) - pl{A)\ = oo 

almost everywhere, and 

Jhrn^ - /^(A)||p = oo 

for every p g [1, oo). For related results and references, see [151 Section 0.2] and 
[251 notes to Section 1.2] for related results and references.) Here, however, we 
will be concerned with the extent to which a bound on the rate of convergence 
can be computed from the initial data. That is, given H, T, and / in the 
statement of the von Neumann ergodic theorem, we can ask whether it is possible 
to compute, for each rational e > 0, a value r(e), such that for every n greater 
or equal to r(e), we have ||v4r(e)/ — < £• 

Determining whether such an r is computable from the initial data is not 
the same as determining its rate of growth. For example, if (a„)„gN is any 
computable sequence of rational numbers that decreases monotonically to 0, 
then a rate of convergence can be computed trivially from the sequence: given 
e, one need only run through the elements of the sequence and until one of 
them drops below e. On the other hand, it is relatively easy to construct a 
computable sequence (6„) of rational numbers that converge to 0, for which 
there is no computable bound on the rate of convergence. It is also relatively 
easy to construct a computable, monotone, bounded sequence (c„) of rationals 
that does not have a computable limit, which implies that there is no computable 
bound on the rate of convergence of this sequence, either. These examples are 
discussed in Section [5] 

Indeed, in Section[5l we show that there are a computable Lebesgue-measure 
preserving transformation of the unit interval [0, 1] and a computable charac- 
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teristic function f — XA such that the hmit of the sequence {Anf) is not a 
computable element of L'^{[0, 1]). For this we rely on standard notions of com- 
putability for Hilbert spaces, which we review there. The noncomputability of 
the limit implies, in particular, that there is no computable bound on the rate 
of convergence of (A„/). On the other hand, we show that, for any nonexpan- 
sive linear operator T on a separable Hilbert space and any element /, one can 
compute a bound on the rate of convergence of {Anf) from T, /, and the norm 
II/* II of the limit. In particular, if T is the Koopman operator arising from a 
computable ergodic measure preserving transformation of a probability space 
X and / is any computable element of L^{X), then ||/*|| is equal to | J / d/z|, 
which is computable; and hence there is a computable bound on the rate of 
convergence. 

In situations where the rate of convergence of the ergodic averages is not 
computable from T and /, is there any useful information to be had? The logical 
form of a statement of convergence provides some guidance. The assertion that 
the sequence {Anf) converges can be represented as follows: 

\le>03n\/m>n{\\A„J-Anf\\<e). (1) 

A bound on the rate of convergence is a function r(e) that returns a witness 
to the existential quantifier for each e > 0. It is the second universal quantifier 
that leads to noncomputability, since, in general, there is no finite test that can 
determine whether a particular value of n is large enough. But, classically, the 
statement of convergence is equivalent to the following: 

Ve > 0, Af : N ^ N 3n {M{n) >n^ || Am(„)/ - AJ|| < e). (2) 

To see this, note that if, for some e > 0, the existential assertion in ([T]) is false, 
then for every n there is an m > n such that || A,„/ — v4„/|| > e. In that case, e 
together with any function M {n) that returns such an m for each n represents 
a counterexample to Assertion ([1]) is therefore equivalent to the statement 
that there is no such counterexample, i.e. assertion 

But if the space is explicitly presented and ([2]) is true, then for each e > 
and Af one can compute a witness to the existential quantifier in ([2|) by simply 
trying values of n until one satisfying ||Ajvf(n) — An\\ is found. Thus, ([2|) has an 
inherent computational interpretation. In particular, given any function K{n), 
suppose we apply ([2]) to a function M{n) which, for each n, returns a value m 
in the interval [n,K{n)] maximizing ||Am/ — Anf\\- In that case, ([2]) asserts 

Ve > 3n Vm e [n, K{n)] \\A„,f - Anf\\ < e. 

In other words, if r{e) is a function producing a witness to the existential quanti- 
fier, then, rather than computing an absolute rate of convergence, r{e) provides, 
for each e > 0, a value n such that the ergodic averages A^f are stable to within 
e on the interval [n, K{n)]. 

It is now reasonable to ask for an explicit bound on r{e), expressed in terms 
of in terms of K, T, /, and e. In Section [21 we obtain bounds on r{e) that, 
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in fact, depend only on K and p = [H/H/el- Since the bound on the rate of 
convergence is clearly monotone with p, our results show that, for fixed if, the 
bounds are uniform on any bounded region of the Hilbert space and independent 
of T. As special cases, we have the following: 

. If X = then r(/,£) = 22°'^'"""°"'. 

• If X = 2'^("\ then r{f,e) — 2]j^^2y where 2^ denotes the nth iterate of 
y > 2^ starting with x. 

• li K = 0{n) and T is an isometry, then r(/,e) = 2^^p''^°^p\ 

Fixing p and a parameterized class of functions K, one similarly obtains infor- 
mation about the dependence of the bounds on the parameters defining K. 

In Section [31 we apply the results of Section [5] to the case where T is the 
Koopman operator corresponding to a measure preserving transformation on a 
probability space X = (X,B,fi). The pointwise ergodic theorem is equivalent 
to the assertion that for any / e L^{X), and for every Ai > 0, A2 > 0, and K, 
there is an n such that 

fi{{x I max \A^f{x) - A„/(a;)| > Ai}) < A2. 

n<.m<K{n) 

When / is in L'^{X), we provide explicit bounds on n in terms of Ai, A2, K, and 
||/||2- In this setting, one can obtain similar bounds using alternative methods, 
namely, from upcrossing inequalities due to Bishop and Ivanov. In SectionlH we 
show that the bounds extracted using these methods are qualitatively different 
from the ones obtained using the methods of Sections [2] and [H 

Our quantitative versions of the mean and pointwise ergodic theorems are 
examples of Kreisel's no-counterexample interpretation [22l[23]. Our extractions 
of bounds can be viewed as applications of a body of proof theoretic results that 
fall under the heading "proof mining" (see, for example, [THl [201 [H])- What 
makes it difficult to obtain explicit information from the usual proofs of the 
mean ergodic theorem is their reliance on a nonconstructive principle, namely, 
the assertion that any bounded increasing sequence of real numbers converges. 
Qualitative features of our bounds — specifically, the dependence only on ||/||, 
K, and e — are predicted by the general metamathematical results of Gerhardy 
and Kohlenbach [9] . Moreover, methods due to Kohlenbach make it possible to 
extract useful bounds from proofs that make use of nonconstructive principles 
like the one just mentioned. These connections are explained in Section [51 

In the field of constructive mathematics, one is generally interested in obtain- 
ing constructive analogues of nonconstructive mathematical theorems. Other 
constructive versions of the ergodic theorems, due to Bishop [H [51 [5], Nuber 
P7] , and Spitters [3T] , are discussed in Sections [H and [3 Connections to the 
field of reverse mathematics are also discussed in Section [5l 

The outline of this paper is as follows. In Sect ions [2] and Section[3l we provide 
our explicit versions of the mean and pointwise ergodic theorems, respectively. 
In Section [4l we compare our results to similar one obtained using upcrossing 
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inequalities. In Section [5] we provide the general computability and noncom- 
putability results alluded to above. In Section [6l we explain the connections to 
proof mining. 

Acknowledgments. We are grateful to Ulrich Kohlenbach and James Cum- 
mings for comments and corrections. 

2 A quantitative mean ergodic theorem 

Given any operator T on a Hilbert space and n > 1, let Snf — J2i<n. ^Vi ^ind 
let Anf = ^Snf- The Riesz version of the mean ergodic theorem is as follows. 

Theorem 2.1 IfT is any nonexpansive linear operator on a Hilbert space and 
f is any element, then the sequence (Anf) converges. 

We present a proof in a form that will be amenable to extracting a construc- 
tive version. 

Proof. Let M = {f \ Tf — /} be the subspace consisting of fixed-points of T, 
and let N be the subspace generated by vectors of the form u — Tu (that is, N 
is the closure of the set of linear combinations of such vectors). 

For any g of the form u — Tu we have H^nsH = < 2||u||/n, which 

converges to 0. Passing to limits (using the fact that A„ satisfies ||A„u|| < |ju|| 
for any v), we have that Ang converges to for every g £ N. 

On the other hand, clearly Anh = h for every h e M . For arbitrary /, write 
f ^ g + h, where g is the projection of / on N, and h = f — g. It suffices to 
show that h is in M. But we have 

\\Th~hf = llT/ilp - 2{Th,h) + \\h\\^ 

<\\hr-2{Th,h) + \\hf 

= 2{h,h) - 2{Th,h) ^ ' 

= 2{h-Th,h), 

and the right-hand side is equal to 0, since h is orthogonal to N . So Th = h. □ 

The last paragraph of proof shows that N-^ C M, and moreover that Anf 
converges to h. It is also possible to show that M-*- C A^, and hence M = N-^, 
which implies that h is the projection of / on M. We will not, however, make 
use of this additional information below. 

As indicated in the introduction, the mean ergodic theorem is classically 
equivalent to the following: 

Theorem 2.2 Let T and f be as above and let M : N N be any function 
satisfying M{n) > n for every n. Then for every £ > there is an n > 1 such 
that \\AM(n)f - AnfW < e. 

Our goal here is to provide a constructive proof of this theorem that provides 
explicit quantitative information. We will, in particular, provide bounds on n 
that depend only on M and ||/|l/e. 



5 



For the rest of this section, we fix a nonexpansive map T and an element / 
of the Hilbert space. A moment's reflection shows that Anf hes in the cychc 
subspace Tij spanned by {/, T/, T^/, .. .}, and so, in the proof of Theorem 12. 11 
one can replace N by the subspace Nf spanned by vectors of the form T'^f — 
T^~^^f. Let g be the projection of / onto Nf. Then g is the limit of the sequence 
(ffi)ieN, where, for each i, gi is the projection of / onto the finite dimensional 
subspace spanned by 

/- T/,T/-r2 /,..., rv-r^+V- 

The sequence {gi) can be defined explicitly by 



ifJ-Tf) 
Wf-TfW 



and . ^ 

_ if - T'-f - y /) , t rpi + lr\ 

9i+i-9i + — ||y»j-_y»+ij||2 — J)- 
For each i, we can write gt — Ui — Tui, where the sequence (wi)igN is defined by 

_ ifJ-Tf) 

and 

{f-9.,T\f-T^+^f) ^^^ 

Note that this representation of gi as an element of the form u — Tu is not 
unique, since if u and u' differ by any fixed point of T, u — Tu — u' — Tu' . 

Finally, if we define the sequence (ai)igN by — \\gi\\, then (oi) is nonde- 
creasing and converges to ||5|| . We will see in Section [5] that a bound on the rate 
of convergence of (a^) might not be computable from T and /. Our strategy 
here will be to show that, given a fixed "counterexample" function M as in 
the statement of Theorem 12.21 the fact that the sequence (a^) is bounded and 
increasing allows us to bound the number of times that M can foil our attempts 
to provide a witness to the conclusion of the theorem. 

First, let us record some easy but useful facts: 

Lemma 2.3 1. For every n and f, ||^n/|| < ||/||. 

2. For every n and u, An{u — Tu) = (u — T^u)/n, and \\An{u — Tu)\\ < 
2\\u\\/n. 

3. For every /, g, and e > 0, */ |1/ — gW < e, then \\Anf — Ang\\ < e for any 
n. 

I For every f, if (/, f-Tf)< e, then \\T f - f\\ < V2^. 
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Proof. The first two are straightforward calculations, the third follows from the 
first by the linearity of An, and the fourth follows from calculation ([3]) in the 
proof of Theorem 12. 11 with / in place of h. □ 

Lemma 2.4 For every f , if \\Tf — f\\ < e, then for every m > n > I we have 
\\Amf — AnfW < (to — n)s/2. In particular, if \\Tf — /|| < s and m > 1, then 
\\A„f-f\\<me/2. 

Proof. Suppose m > n > 1. Then 

m—1 n— 1 



Amf - AnfW = ||1 ^ TV - - E TVIi 
TO ^ — ' n ^ — ' 

m — 1 n— 1 

= — lln^TV-TO^TVII 
mn ^ — ' ^ — ' 

i=0 j=o 

m — 1 n— 1 

= — ||n^r7-(TO-n)^TV|| 



mn 

i—n j— 



There are now n-(m—n) instances of T'/ in the first term and n-{m—n) instances 
of f in the second term. Pairing them off and using that ||T*/ — T^/|| < 
(i — j) ■ £ for each such pair, we have 



m(m — 1) n(n — 1)\ / n(n ~ I) 

' — (to — ?i) ' 



2 2 

'to(to— 1)\ /n(n — 1) 



= (to — n)e/2, 

as required. □ 

We now turn to the proof of the constructive mean ergodic theorem proper. 
The first lemma relates changes in gi to changes in Oi. 

Lemma 2.5 Suppose \aj — ai\ < e^/(2||/||). Then \\gj — gi\\ < e. 

Proof. Assume, without loss of generality, j > i. Since gj is the projection of / 
onto a bigger subspace, gj — g.^ is orthogonal to gi. Thus, by the Pythagorean 
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theorem, we have 




= \aj — ai \ ■ \aj + ai\ 




as required. 



□ 



The next lemma introduces a strategy that we will exploit a number of times. 
Namely, we define an increasing function F such that if, for some j, \\gF(j) ~5jll 
is sufficiently small, we have a desired conclusion; and then argue that because 
the sequence (oi) is nondecreasing and bounded, sufficiently many iterations of 
F will necessarily produce such a j. (In the next lemma, we use F{j) = j + 1.) 

Lemma 2.6 Let e > 0, let d ~ d{e) = [32||/||^/£^] . Then for every i there is 
a j in the interval [i, i + d) such that \\T[f — gj) — (/ — gj)\\ < s. 



Proof. By Lemma [273] 4. to obtain the conclusion, it suffices to ensure {f—gj, f— 
gj - T{f - gj)) < We have 

(/ - .9j, / - .9j - T{f ~ gj)) = (/ - gjj - Tf) + (/ - gj^Tg^ - g^) 

= if - dj^Tgj - gj) 



because gj is the projection of / on a space that includes / — T/, and / — gj 
is orthogonal to that space. Recall that gj is a linear combination of vectors of 
the form T'^f — T^+^ f for k < j, and gj+i is the projection of / onto a space 
that includes Tgj — gj. Thus, continuing the calculation, we have 



• = (./- gj+i,Tgj - gj) + {gj+i ~ gj,Tgj - g^) 

= {gj+i -gj^gj-Tgj) 

< ll.9j+i -ffill • \\Tgj- gjW 

< ||.9,+i-g,||(|!rg,|| + ||g,||) 
<2||.9,+i-g,||. 11/11 



Thus, if \\gj — gj+i\\ < gljjyj we have the desired conclusion. 

Consider the sequence Oi, ai+i, ai+2, . . . , ai+d_i. Since the a^'s are increasing 

I f II ^ 

and bounded by ||/||, for some j G [i,i + d) we have \aj+i ~ aj\ < < ■g^fjp'- 



Lemma 2.7 Lete > 0, letn > 1, and let d' = d'{n,e) = d{2e/n) = \2n'^\\f\\^/e^'\. 
Then for any i, there is an j in the interval [i, i + d') satisfying \\An{f — gj) — 
(/-.9j)II <£• 



By Lemma [2.51 this implies \\gj — gj+i\\ < jfwi a-s required. 



□ 
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Proof. By the previous lemma, there is some j in the interval [i,i + d') such that 
\\Tif-9j)-if-9j)\\ < 2e/n. By Lemma[2l]this implies |lA„(/-9j)-(/-g,)|| < 

£. □ 



Lemma 2.8 Lets > 0, letm> \, letd" = d"{m,e) = d'{m,e/2) = \32m'^\\f\\'^/e*'\. 
Further suppose \\gi — gi+d"\\ < e/4. Then for any n < m, \\A„{f — gt) — {f ~ 
g^)\\<e■ 

Proof. By the previous lemma, for any n < m, there is some j in the interval 
[i, i + d") such that || A„(/ - gj) - (/ - gj)\\ < e/2. This implies 

WMf ~ 9^) ~ if ~ 9^)\\ < WMf - 9^) ~ Mf ^ 9,)\\ 

+ \\AM - g,) - (/ - g,)\\ + II (/ - g,) - (f - .g,)ll 

= l|A„(.g, - 5^)11 + WMf - 9j) -if- 9j)\\ + U~ 9j\\ 

< \\AM'9J)~{f-9J)\\+2\\9,-g^\\ 

<£, 

since ||5j - g,\\ < \\g^+d" - .9i|| < e/4. □ 



Lemma 2.9 Let e > 0, let m > 1, let d'" = d"'{m,e) = d"{m,e/2) = 
[2^TO^||/||'*/£'*] . Further suppose \\gi — gi+d"'\\ < e/8. Then for any n < m, 
\\A.^{f-g.i)^AM-9:)\\<e- 

Proof. Apply the previous lemma with e/2 in place of e. Then for every n < m, 

WArnif - g^) - A„(/ - 5.)|| < \\Ar,Af ~ .9.) - (/ - .9.)ll + 

\\AnU - g^) - if - gr)\\ 

< e/2 + e/2 ^e, 

as required. □ 

We will not need the following lemma until Section [3l but it is a quick 
consequence of the preceding result. 

27]] j-||2 

Lemma 2.10 Lete > 0, m > 1, d'" = d"'{m,e) as in 

LemmaSSM e = \^^^^, 

and d — d{m,e) = d'" ■ e. Then for any i, there is a j in the interval [i,i + d) 
such that for every n < m, ||y4„(/ — gj) — Am{f — gj)\\ < e. 

Proof. Consider the sequence a^, Oi+d'" , a-i+d"'-2, ■ ■ ■ , o-i+d'" -{e-i)- Since the se- 
quence (oi) is increasing and bounded by || / || , for some k < e and j = i + d'" ■ k, 

we have ||aj — aj-|-d"'|| < £^/(2''||/||). Bv Lemma l2.5[ this implies H^j — gj+d'" 1 1 < 
e/8. Applying the previous lemma with j in place of i, we have the desired con- 
clusion. □ 
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Let us consider where we stand. Given e > and a function M satisfying 
M{n) > n for every n, our goal is to find an n such that ||^ji/(n)/ ^ ^nfW ^ £• 
Now, for any n and i, we have 

< PM(n)(/-5,:) - + \\AM{n)9i\\ + IIAiff,:!!- 



Lemma 12.91 tehs us how to ensure that the first term on the right-hand side 
is small: we need only find an i such that H^i+d'" ^ is small, for some d'", 
depending on M{n), that is sufficiently large. On the other hand, by Lemma [2?3l 
andM(n) > n, we have \\Ang^\\ < \\u^\\/{2n) and \\AM(n)g^\\ < h»ll/(2Af (n)) < 
||ui||/(2n). Thus, to guarantee that the remaining two terms are small, it suffices 
to ensure that n is sufficiently large, in terms of Ui. 

There is some circularity here: our choice of i depends on M{n), and hence 
n, whereas our choice of n depends on Ui, and hence i. The solution is to define 
sequences {ik)ki£N and {nk)k<£N recursively, as follows. Set io — 1, and, assuming 
ik has been defined, set 



Uk — max( 

and 

=ik + d"'ie/2, M{nk)) = ik + 



,1) (4) 

2"MK)4|1/|14] 



Let e = , and consider the sequence o^q, a^^ , . . . , ai^_^ . Once again, 

since this is increasing and bounded by ||/||, for some fc < e we have \ai^^^ — 
Oifcl < £^/2^||/||. Lemma [^31 implies 115^^^! — QikW — Write i = ik and 

n = Uk, so that ik+i = i + d"'{M (n), e/2). Applying Lemma [2.91 we have 

\\AMin){f - 9^) - Anif - gi)\\ < e/2. 

On the other hand, from the definition of n = n^, we have 

\\An9^\\ < ||u^||/(2n)<£/4 

and 

\\AM(n)g^\\ < H\\/{2n) < e/4, 

so \\AM{n)f — AnfW < e, as required. Notice that the argument also goes 
through for any sequences (ik) and (n^) that grow faster than the ones we have 
defined, that is, satisfy ([4]) and (O with "=" replaced by ">." In sum, we have 
proved the following: 

Lemma 2.11 Given T, f , e, and M, sequences {ik) and [uk) as above, and the 
value e as above, there is ann satisfying 1 < n < n^-i and ||^_A/(n)/^^?i/|| < £• 

This is almost the explicit version of the ergodic theorem that we have 
promised. The problem is that the bound, ie, is expressed in terms of sequence 
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of values as well as the parameters M, f, and e. The fact that the term 
||T*/ — T'+^/ll appears in the denominator of a fraction in the definition of the 
sequence (ui) makes it impossible to obtain an upper bound in terms of the other 
parameters. But we can show that if, for any i, ||T*/ — T'+^/|| is sufficiently 
small (so T*/ is almost a fixed point of T), we can find alternative bounds on 
an n satisfying the conclusion of our constructive mean ergodic theorem. Thus 
we can obtain the desired bounds on n by reasoning by cases: [fT'f-T'+^f is 
sufficiently small for some i, we are done; otherwise, we can bound ||wi||. 

The analysis is somewhat simpler in the case where T is an isometry, since 
then ||TV - T'+\/|| = 11/ - T/ll for every i. Let us deal with that case first. 

Lemma 2.12 If T is an isometry, then for any m > 1 and e > 0, one of the 
following holds: 

1- II An/ -./II <e, or 

/n II II ^ (i+l)m\\ fW^p 

2. \\Ui\\ < '^^ " for every i. 

Proof. By the Cauchy-Schwartz inequality we have ||uo|| < ||/||^/||/ — Tf\\. By 
LemmaEH if ||/ - T/|| < 2e/m then \\Amf - /|| < e. 

Otherwise, 2£/m <\\f- T f\\ = ||TV - T^+Vll for every i. In that case, we 

I f II ^ 

have ||wo|| ^ 2e ' since ||/ — ,gi|| < ||/||, we obtain 

mil f II II f — Qill ,, ,, mW f IP 

Ih^+ill < \\u,\\ + "-^'"^^^ < \\u,\\ + 

for every i. The result follows by induction on i. □ 

We can now obtain the desired bounds. If n = 1 does not satisfy ||^M(n)/~ 

^n/ll < Ej we have < ] for each k. Otherwise, let K be any 

nondecreasing function satisfying K{n) > M{n) > n for every n. From the 
definition of the sequence (i^), we can extract a function K{i) such that for 
every k, K''{1) > ij^: 

• p=ni/ii/£i 

• K{i) = I + 2i3p4^((i + l)if (l)p2) 

• e = 

As long as / is nonzero, we have p > 1, which ensures that i^'^(l) > 7ie_i and 
^'=(1) > 1. Thus, we have ||Am(„)/ - ^„/|| < e for some n < K^l). 

On the other hand, given a nondecreasing function K to serve as a bound 
for Af , the best a counterexample function M{n) can do is return any m in the 
interval [n,K{n)] satisfying \\A„if — A„f\\ > e, if there is one. Thus, we have 
the following: 
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Theorem 2.13 Let T be an isometry on a Hilhert space, and let f be any 

nonzero element of that space. Let K be any nondecreasing Junction satisfying 
K{n) > n for every n, and let K be as defined above. Then for every e > 0, 
there is an n satisfying 1 < n < K'^(l), such that for every m in [n,K(n)], 

This is our explicit, constructive version of the mean ergodic theorem, for 
the case where T is an isometry. If T is merely nonexpansive instead of an 
isometry, the argument is more complicated and requires a more general version 
of Lemma 12.41 



Lemma 2.14 Assume T is a nonexpansive mapping on a Hilbert space, f is 
any element, m > n > 1, and £ > 0. Then for any k, if n > 2fc||/||/£ > k, then 
either \\T^ f - T'^+Vll > e/{2m) or \\A^f - A^f\\ < e. 



Proof. We have 

m— 1 

ran 



A^f - AnfW = — ||n ^ T7 - ™ ^ TVII 
ran ^ — ' ^ — ' 

i=0 j=o 

^ k-l fc-1 

<—\\nJ2Tf-mJ2T^f\\ 
ran ^ — ' ^ — ' 

1=0 j=o 

-. m — 1 n— 1 

i—k j—k 
^ k-l 

< _||(„_™)^r-'/|| 

ran ^ — ' 

-. ra~k~\ n—k—1 

— \\n ^ T\T'^f)-m ^ T-'-(TV)||. 



mn 

1=0 j=o 



The first term is less than or equal to 



(^-")||yT.;||<m<e/2. 
mn ^-^ n 

i=0 

Using an argument similar to the one used in the proof of Lemma l2.41 we have 

rn—k—l n—k—1 

— \\n J2 T\T^f)^ra ^ T^(r'^7)ll < 
nra ^ — ' ^ — ' 

i=0 j=o 

(m - n)||TV - T^+Vll < ™||TV - T'^+VII 

If \\T^f ~ T'^'+Vll < 2^, the second term in the last expression is also less than 
or equal to e/2, in which case — Anf\\ < e. □ 
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We now have an analogue to Lemma 12.121 for the nonexpansive case. 
Lemma 2.15 For any i > 0, n > 1, and e > 0, either 

1. there is an n < 2i[-!i|^] such that ||^M(n)/ ~ ^n/ll ^ or 

Proof. Use induction on i. At stage i + 1, if clause 1 doesn't hold, we have 
~ ^i+i/ll > in which case we can use the inductive hypothesis, 
the previous lemma, and the definition of Ui+i to obtain clause 2. □ 

The definition of the sequences {ik) and {n^) remain valid. What has changed 
is that we now have a more complex expression for the bounds on in the case 
where case 2 of Lemma [2.151 holds for each ik- In other words, we have that for 
every k, 



nk < 



-2 

1=0 



^Mi2l\\\f\\/e}: 



unless there is an n < 2ifc [||/||/£] such that \\AM(n)f ~ < £• Assuming 

if is a nondecreasing function satisfying K{n) > M{n), we can replace this last 

bound by ^^l^{ik + l)K{2ik\\\f\\/e])\. Define 
• P=\\\f\\/e] 



• K{i) = i + 2^'^p^K{{i + l)K{2ip)p^) 

• e — ""^ 



-2V 
Then we have: 

Theorem 2.16 Let T be an nonexpansive linear operator on a Hilbert space, 
and let f be any nonzero element of that space. Let K be any nondecreasing 
function satisfying K{n) > n for every n, and let K be as defined above. Then 
for every e > 0, there is an n satisfying I < n < K (1), such that for every m 
in [n,K{n)], \\A„J - A^fW < e. 

Direct calculation yields the following asymptotic bounds. 

Theorem 2.17 Let T be any nonexpansive map on a Hilbert space, let K be any 
nondecreasing function satisfying K{n) > n for every n, and for every nonzero 
f and e > 0, let rK{f,£) be the least n > I such that \\Amf — Anf\\ < £ for 
every m in [n,K(n)]. 

• IfK = then rKif,e) = 2^"''' 

• IfK = 20(")^ then rK{f,£) = 2\,^^,y 

• If K = 0{n) and T is an isometry, then rxifjS) — 2^^p i°gp). 
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In these expressions, p abbreviates [|l/|l/e] and 2^ denotes the nth iterate of 
y i—> 2^ starting with x. 

Alternatively, we can fix p and consider the dependence on K . Here are two 
special cases. 

Theorem 2.18 Let T be an isometry on a Hilbert space, and let K be as above. 
Ftxp=\\\f\\/e]. 

• If K{x) = X + c, then, as a function of c, rK{f,£) — 0{c). 

• If K{x) — cx + d, then, as a function of c, rx{f,£) = c*^'^^. 

3 A quantitative pointwise ergodic theorem 

Let r be a measure preserving transformation on a probability space {X,B,fx), 
and let T be the Koopman operator on the space L^{X), defined by Tf — for. 
The mean ergodic theorem implies that for any / in L^(A'), the ergodic averages 
converge in the norm. But since, for any / in L^(A'), ||/||i < |l/|i2, this 
implies convergence in the norm also. We also have that L^{X) is dense in 
L^(A'), and if ||/ - /'||i < e then ||A„/ - A„/'|ji < e for every n. As a result, 
we have convergence for every / in L^[X) as well. 

Birkhoff's pointwise ergodic theorem makes a stronger assertion: 

Theorem 3.1 Let T be the Koopman operator corresponding to a measure pre- 
serving transformation t on a probability space X , and let f be any element of 
L^{X). Then {A^f) converges pointwise, almost everywhere. 

This is equivalent to the following: 

Theorem 3.2 Given T and f as above, for every Ai > and A2 > there is 
an n such that for every k > n, 

fi{{x I max \Arnf{x) - Anf{x)\ > Ai}) < A2. 

n<m<A: 

By the logical manipulations described in the introduction, this, in turn, is 
equivalent to the following: 

Theorem 3.3 Given T and f as above, for every Ai > 0, A2 > 0, and K there 
is an n > 1 satisfying 

fi{{x I max \Anf{x) - A„J{x)\ > AJ) < A2. 

n<m<K[n) 

Using the maximal ergodic theorem, described below, one can reduce all three 
versions of the pointwise ergodic theorem to the case where / £ L^{X). In this 
section, we will provide a constructive proof of Theorem 13.31 for this restricted 
case, with an explicit bound on n, expressed in terms of K, ||/||2, Ai, and A2. 
The maximal ergodic theorem can be stated as follows: 
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Theorem 3.4 Suppose n > 1, and let A = {x \ maxj<„ ^^^^ r-'/(a;) > 0}. 
Then J^fd^ > 0. 

The proof of this is essentiaUy constructive (see [31], and the proofs in [3l [32]). 
We wiU make use of the foUowing coroUary: 

Corollary 3.5 For any A > and n > 1, 



fi{{x\ max |^,/(x)|>A})<||/||i/A. 

l<i<n 



Proof. We have 



{x\ max \Aif{x)\> X}C{x\ max yli(|/|)(a;) > A} 

l<i<n l<i<n 

^{x\ max ^,(|/|-A)(x) >0} 

l<i<n 

= {x\ max ^T^(|/|-A)(x) >0}. 

j<i 

Call this last set A. By Theorem 13.41 we have /^(|/| — A) > 0, and hence 
||/||i= / \m> I I/Mm > AAi(A), 



so Ai(A) < 11/11 i/A. □ 
Taking limits, CoroUarv 13.51 implies that for every A > 0, 



ix{{x\ sup|A,/(x)| >A})< ll/lli/A. 
i>i 

We will stick with the formulation above, however, to emphasize the combina- 
torial character of our arguments. 

Most contemporary presentations of the pointwise ergodic theorem proceed 
to define f*{x) ~ limsup^„/(a;) and f*{x) = liminf yl„/(x), and then use the 
maximal ergodic theorem to show that the two are equal almost everywhere. 
Billingsley [3], however, presents a proof that makes use of the limit of 
Anf{x), as guaranteed to exist by the mean ergodic theorem, rather than /* 
and . We will "mine" this proof for our constructive version. 

The idea is as follows. For the moment, let h denote the limit of (A„/), 
and for each i let fi = h + gi, where the sequence (gi) is as defined in the last 
section. Then / = lim^ fi. For any m > 1, n > 1, i > 0, and x ^ X, we have 

\A„J{x) - A^f{x)\ <\A,M - f^){x)\ + \A„J,{x) - A,J,{x)\ + 

\AM~m^)\ 

< \A^{f - fiKx)\ + \A,nh{x) - Anh{x)\+ ^ ' 
\A^g^{x)\ + \A^g^{x)\ + \AM - /O Wl- 

Using the maximal ergodic theorem, the first and last terms can be made small 
outside a small set of exceptions, for all values of m and n simultaneously, by 
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taking i big enough so that ||/ — fi\\2 is small. The second term is equal to 0, 
since /i is a fixed point of T. Finally, using the fact that gi is of the form u — Tu, 
the third and fourth terms can be made arbitrarily small in the norm, and 
hence small outside a small set of exceptions, by taking m and n sufficiently 
large. 

The problem is that in our constructive version we do not have access to h, 
which is equal to / — limi gf, nor can we determine how large i has to be to 
make ||/ — fi\\2 sufhciently small. Instead, we replace h by an approximation 
/ — gj . Then we get 

\A^f{x) - Anf{x)\ < \A^{gj - g^){x)\ + 

\An{f - gj + gi){x) - A^if - gj + g^){x)\^ 
\An{9] -9i){x)\ 
< \Am{gj - g^){x)\ + |An(/ " gj){x) ~ Anif - gj){x)\+ 
\A,ngi{x)\ + \Angi{x)\ + \An{gj - gi){x)\. 

Similar considerations now hold: we can make the first and last terms small 
outside a small set of exceptions, independent of m and n, by ensuring that 
\\gj — gi\\2 is sufficiently small. We can make the second term small using 
Lemma 12.101 with an appropriate choice of j. Finally, the third and fourth 
terms are small outside a small set of exceptions when m and n are sufficiently 
large. 

Thus our task is to find a value of n such that 

^max i\Am{gj - g^){x)\ + |A„(/ - gj){x) - A„(/ - gj)ix)\ + 

n<7n<K (n) 

\Arng^ix)\ + \A„g,ix)\ + \A„igi - (7) 

is less than or equal to Ai, outside a set of size at most A2. We will consider the 
various components of this sum, in turn. 

We will make use of Chebyshev's inequality, which shows that |/(a;)| is small, 
outside a small set of exceptions, when |j/||2 is small: 

Lemma 3.6 For any A > 0, n{{x \ \f{x)\ > A}) < ||/||i/A^ 

Proof. Otherwise, we would have ||/||2 = / \f\^dn > A2(||/||2/a2) = |1/|12. □ 

The next lemma deals with the first and last terms in ([7]). 

Lemma 3.7 Suppose \\gj — gi\\2 < A1A2/8. Then for any k and n satisfying 
1 < n < fc, we have 

fi{{x I max - g^){x)\ + \An{gj - g;){x)\) > Ai/2}) < A2/2. 

n<m<k 

Proof. By Corollary l3.5[ we have 

fxiix I max |A,„(5, - g,)ix)\ > Ai/4}) < (Ai A2/8)/(Ai/4) = A2/2. 
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Since \Am{gj ~ gi){x)\ + \An{gj — gi)ix)\ > Xi/2 implies that either \Amigj — 
gi){x)\ > Ai/4 or \An{gj — gi){x)\ > Ai/4, the set in the last displayed formula 
includes the set in the statement of the lemma. □ 

The next four lemmas concern the third and fourth terms of (O, which are 
of the form Ang ~ An {u — Tu) = (u — T"'u) jn. To show that we can make these 
terms small outside a small set by making n sufficiently large, we will need to 
split u into two components, one of which is bounded, and the other of which 
is small in the norm. The following lemma enables us to do this. 

Lemma 3.8 For any u G L^(A') and L > 0, write u ^ u' + u" , where 

u'lx) = I ^^^^ ^f\u{x)\ < L 
1 otherwise 

and u" = u — u' . Then ||u'||oo < L, and ||w"||i < ||u||2/i- 
Proof. The first claim is immediate. For the second, we have 

\u{x)\dfi< u'^{x)/L d^i < \\u\\l/L, 

I |ii(a;)|>L} J{x \ \u{x)\>L} 

as required. □ 




Lemma 3.9 Let u G L^{X), let g = u — Tu, and suppose n > and 
n > 1. Then for any k > n, 

fi{{x I max (|A,„g(x)| + |A„g(a;)|) > Ai/4}) < A2/4. 

n<m<k 

Proof Let L = 2'^||u||^/AiA2, and let u — u' + u" be the decomposition in 
Lemma [3.81 Then 

\A„,g,{x)\ + \Ang^{x)\ < \A„,{u' - Tu'){x)\ + |A„(m' - Tu'){x)\ + 

- Tu"){x)\ + \An{u" - Tu"){x)\. (8) 

Then we have \\u" - Tu"\\i < 2\\u"\\i < 2\\u\\yL, and so, by CoroUaryEH 
I max \Aniu" - Tu")ix)\ > Ai/16}) < ^-^^ = A2/4. 

l<ri<fc LAi 

So for I < n < m < k, the sum of the last two terms on the right-hand side 
of ([8]) is at most Ai/8 outside a set of measure A2/4. On the other hand, for 
every x and m > n, \A„i{u' — Tu'){x)\ and |A„(u' — Tu'){x)\ are bounded by 
2||M'||oo/n < 2L/n < Ai/16, since n > 32L/Ai. □ 

Taking k — K{1) in the following lemma shows that if n = 1 does not provide 
a witness to the conclusion of our constructive pointwise ergodic theorem, we 
can bound the terms |lui||2. This is analogous to Lemma [2. 121 
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Lemma 3.10 For any k>l, one of the following holds: 
1. ii{{x I maxi<,„<fe \{Araf - /)(a;)| > Ai}) < A2, or 

aw II / (i+'^)\\f\\lk^'^ t 

II "1*1 II 2 S — xjx •'"'^ every i. 

Proof. Suppose, first, ||r/-/||2 < 2Al^A^/fc3/2. Then by LcmmaEH for every 
m satisfying 1 < m < /c we have ||^m/ ~ /II2 < \i\/\2/ Vk. By Lemma [3.61 
this implies 

^l{{x I \{A„J - f){x)\ > Ai}) < {XlX2/k)/Xf = X2/k 

for each m in the interval [1, k]. So, in that case, clause 1 holds. 

Otherwise, we have ||r/-/||2 > 2AiVA^/fc3/2. In that case, clause 2 follows 
from the definition of the sequence (u^), as in the proof of Lemma [2. 121 □ 

Combining the last two lemmas, we have the following. 

Lemma 3.11 Either ^j.{{x \ rRaxi<,n<K{i) \{Anf - f){x)\ > Ai}) < A2, or, for 

every i, ij n > .4.2 , then for any k > n we have 

fi{{x I max {\Amg^{x)\ + \Angi{x)\) > Ai/4}) < A2/4. 

n<m<k 

Finally, we address the second term of fT]), using Lemma [2. 101 

Lemma 3.12 Given 1 < n < k, let e = e(fc,Ai,A2) = 234F[||/||2/(AiVA^)]^ 
Then for any i, there is a j in the interval [i, i + e) satisfying 

fi{{x I max \A,n{f ~ gj){x) - A„(/ - 5,)(x)| > Ai/4}) < A2/4. 

n<m<fc 

Proof. Working backwards, it is enough to ensure that for every m satisfying 
n < m < fc, 

^Ji{{x I |A™(/ - g,){x) - Anif - g,){x)\ > Ai/4}) < ^. 



By Lemma 13.61 it suffices to ensure that 

1&\\A,M - 9,) - Mf - 9j)\\l . A2 
A? - 4fc' 

for each such to, i.e. 

Ai\/A2 



\\A^{f - gj) - An{f - gj)\\2 < 



8k 



Substituting the right hand side for e in Lemma 12.101 this is guaranteed to 
happen for some j in [i,i + e). □ 
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We can finally put all the pieces together. Set io ~ 0. Assuming has been 
defined, let 

^ol2I.-'/■1^3„•2ll -fi^^ 



nk = 
and 



4\2 



At A 



ik+i ^ik + e{K{nk), Ai, A2) = ik + 2''^K{nky\\\fh/{XiV^)]''- 
Finally, define 

e 

Then we have: 



Ai V A2 



Lemma 3.13 Let Xi > 0, let A2 > 0, and let K he any Junction. Given f, 
define the sequence {ik) and the value e as above. Then there is an n satisfying 
1 < n < He and 

fi{{x I max \Amf{x) - Anf{x)\ > Ai}) < A2. 

n<m<K (n) 

Proof. Suppose n — I does not witness the conclusion. As in the proof of 
Lemma [2.111 there is some k < e such that for \ai^^-^ ~ o.ik\ — wfjf^- 
n — Uk and i — ik. By Lemma I3.12| there is a j in the interval [i^ik+i) 
satisfying 

^l{{x I max |A„,(/ - gj){x) - A^if - gj){x)\ > Ai/4}) < A2/4. 

n<m<K [n) 

Since i < j < ik+i we have \aj — aj| < ja^^^j — aj\ and so \\gi — gj\\ < A1A2/8. 
By Lemma 13.71 we have 

H{{x I max {\Ara{gj - g^){x)\ + \A^{gj - g,){x)\) > Ai/2}) < A2/2. 

n<'m<K{n) 

By Lemma 13.111 we have 

tl{{x\ max {\{Arag^){x)\+\{Ang^){x)\)> Xi/A})<X2/A. 

n<.m,<.K (n) 

The result follows. □ 
Once again, we can do some final housecleaning. Define: 

• p= ni/ii/(AiVA^)i 

• K{i) = I + 23V^i^(2l2i^(l)3i2p4) 



• e 



r27||/||2/(AiVA^)l 



Theorem 3.14 Let t he any measure preserving transformation of a finite mea- 
sure space X , and let f be any nonzero element of L^{X). Let Ai > 0, let A2 > 0, 
and let K be any function. Then, with the definitions above, there is an n sat- 
isfying 1 < n < K (1) and 

li{{x I max \Amf{x) - Anf{x)\ > Ai}) < A2. 

n<7n<K(n) 
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4 Results from upcrossing inequalities 



We are not the first to develop constructive versions of the ergodic theorems. 
Let r be a measure preserving transformation on a finite measure space X = 
(X,B,fj,), and for every real a < (3, let uja^/six) denote the number of times 
the sequence (A„/(a;))„gN upcrosses the interval [a,/3], that is, proceeds from 
a value Aif{x) less than a to a value Ajf{x) greater than /3. For any x G X, 
the statement that {Anf{x))ni£n converges is clearly equivalent to the statement 
that, for every a < P, uJa^pix) is finite. Bishop [4l[5l[6] showed that for any / 
in L^{X), we have 



In particular, let fljp denote the set {x \ uja.p{x) > k}, that is, the set of points 
for which the sequence makes no less than k upcrossings. Bishop's inequality 
immediately implies 



from which the ordinary pointwise ergodic theorem follows in a straightforward 
way. Let the set ft^^ be defined, analogously, to be the set of elements x for 
which the sequence (A„/(a;))„gN makes no less than k downcrossings from a 
value above /3 to a value below a. Recently, Ivanov [1^ has shown that for 
nonnegative functions /, the size of this set decays exponentially with k: 



A similar result was obtained, independently, by Kalikow and Weiss [16]. Bishop's 
and Ivanov's results and their consequences are explored thoroughly in [TS]. 
There has recently been a surge of interest in such upcrossing inequalities; see, 
for example, 

Upcrossing inequalities can be used in a crude way to obtain bounds on 
our constructive pointwise ergodic theorem. Theorem 13.31 They can also be 
used, indirectly, to obtain bounds on our constructive mean ergodic theorem. 
Theorem 12.21 in the specific case where the operator in question is the Koop- 
man operator corresponding to a measure preserving transformation. Of course, 
the upcrossing inequalities characterize the overall oscillatory behavior of a se- 
quence, and thus provide a more information. On the other hand, our results 
in Section [2 apply to any nonexpansive mapping on a Hilbert space, and so are 
more general. There are further differences: because we obtain our pointwise 
results from our constructive version of the mean ergodic theorem, the norm 
II/II2 of / plays a central role. In contrast, results obtained using upcrossing 
techniques are more naturally expressed in terms of ||/||i and ||/||oo- In this 
section, we will see that when the two methods yield analogous results, they 
provide qualitatively different bounds. 
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First, let us show how Bishop's inequality leads to a bound on the witness to 
our constructive pointwise ergodic theorem, Theorem l3.3[ when / is in L'^{X). 
Note that for any (3 > a > 0, Bishop's result imphes that the number of 
upcrossings of the interval [a, f3] satisfies 

/ UJa,i3 dfi < ||/||oo/(/3 - a). 
Jx 

By symmetry, the same bound holds for downcrossings. Given Ai, divide the 
essential range [— 1|/||, ||/||] of / into [4||/||oo/Ai] intervals of size Ai/2 each. 
Given K as in Theorem 13.31 consider the sequence of e + 1 intervals 

[l,/f(l)], [Kil),KiKm, [A-(l),if'^+i(l)]. 

For every i e [0, e] and j £ [1, [4||/||oo/Ai]], let Aij denote the set of x such 
that {Anf{x)) upcrosses or downcrosses the jth interval in the essential range 
of / somewhere in the interval [K''{1), K^^^{1)]. Now, suppose that for each 

i < e, 

I , max \A„J{x) ~ > Ai}) > A2. (9) 

Since each x in this set either upcrosses or downcrosses one of the [4||/||oc/Ai] 
intervals in the range of /, we have, for each i < e, 

j=i...r4||/iu/Ai] 

So 

5Z MAj) > (e + l)A2, 

j=0...ej = l...r4||/||„„/Ai] 

which means that for some j, J2i=o e/^(^*j') > (e + l)AiA2/4||/||oo- Let [a,/9] 
be the corresponding interval, and let u!'{x) be the number of times {Anf{x)) 
upcrosses or downcrosses this interval. Then we have 

J2 ^^{A^^,)= f E XA^.,dfi< fcj\x)d^i<2\\f\\^/{f3-a)=4\\f\\^/Xi. 

i=0...e '' j=0...e 

In other words, we have shown that if ^ holds for each i < e, we have 

(e + l)AiA2/4||/||oo <4||/||oo/Ai, 

which implies e + 1 < 16||/||^/A2A2. Taking the contrapositive of this claim, 
we have the following analogue of Theorem 13.141 

Theorem 4.1 Let T he a Koopman operator corresponding to a measure pre- 
serving transformation of a space X and let f be any element of L°°[X). Let 
Ai > 0, let A2 > 0, and let K he any function satisfying K{n) > n for every n. 
Let e — [16||/||^/A2A2] . Then there is an n satisfying 1 < n < K'^{1) and 

li{{x I max \A,nf{x) - A„/(x)| > Ai}) < A2. 

n<m</C(n) 
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In other words, we can bound a witness to our constructive pointwise ergodic 
theorem by e = [16||/||^/Af A2] iterations of K on 1. Compare this to our 
Theorem l3.141 which requires asymptotically fewer (r2'^||/||2/Ai-\/A2l) iterations 
of a function, K, which, however, grows faster than K. 

Ivanov's inequality does not seem to enable us to improve the bound in 
the previous theorem. But a consequence of Ivanov's inequality, obtained by 
Kachurovskii, enables us to treat the mean ergodic theorem in a similar way. 
A sequence of real numbers a„ is said to admit k e- fluctuations if there is a 
sequence 

mi < ni < TO2 < ^2 < ■ ■ ■ < rrik < Uk 

such that for every i, I < i < k, lami—am \ > £• Let T be the Koopman operator 
arising from a measure preserving transformation on X . By the mean ergodic 
theorem, for every e > 0, the number k^ of e- fluctuations is finite. Kachurovskii 
[15l Theorem 29] shows: 

Theorem 4.2 Let f be any element of L°°{X). Then for every e > 0, 
..<c(M^y(l + ln(M^)) 

for some constant C . 

Now, given any counterexample function AI satisfying M{n) > n for every n, 
consider the sequence 

^l/i ^M(l)/i ^M(M(1))/: ■ ■ ■ 1 ^M'=e + i(l)-/'- 

At least one step must change by less than e. Thus, we have the following 
analogue to our Theorem 12. 161 

Theorem 4.3 Let T be a Koopman operator corresponding to a measure pre- 
serving transformation of a space X and let f be any element of L°°{X). Let 
K be any function satisfying K{n) > n for every n. Let k{f,s) be the bound 
on k^ given in the preceding theorem. Then for every e > 0, there is an n, 
I < n < K''^-f''^\l), satisfying \\Amf — Anf\\ < e for every m G [n,K{n)]. 

In other words, we can bound a witness to the conclusion of the constructive 
mean ergodic theorem with k{f,e) iterates of K. In contrast. Theorem 12.161 
required e(/, e) = C[||/||2/e] iterates of a faster-growing function K. 

5 Computability of rates of convergence 

Suppose (an)„gN is any sequence of rational numbers that decreases monoton- 
ically to 0. No matter how slowly the sequence converges, if one is allowed to 
query the values of the sequence, one can compute a function r{e) with the 
property that for every rational e > and every m > r{e), |a„i — ar{e)\ < 
The algorithm is simple: on input e, just search for an m such that am < s. 
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On the other hand, it is not hard to construct a computable sequence (a„)„gN 
of rational numbers that converges to 0, with the property that no computable 
function r{e) meets the specification above. This is an easy consequence of 
the unsolvability of the halting problem. Let (Mi)i>o be an enumeration of 
Turing machines, and let ji be an enumeration of the natural numbers with the 
property that every natural number appears infinitely often in the enumeration. 
For every i, let = if Turing machine Mj., when started with input 0, 
halts in less than i steps, but not in i' steps for any i' < i such that jV = ji', and 
let tti = otherwise. Then (aj) converges to 0, since once we have recognized 
all the machines among Mi, . . . , M„ that eventually halt, remains below 1/n. 
But any any value r{l/n) meeting the specification above tells us how long we 
have to wait to determine whether M„ halts, and so any such such r would 
enable us to solve the halting problem. 

In a similar way, one can construct a computable sequence (a„)„gN of ratio- 
nal numbers that is monotone and bounded, but converges to a noncomputable 
real number. This, too, implies that no computable function r(e) meets the 
specification above. Such a sequence is known as a Speaker sequence^ and an 
example is given in the proof of Theorem 15. 1[ below. Thus neither monotonic- 
ity nor the existence of a computable limit alone is enough to guarantee the 
effective convergence of a sequence of rationals. 

What these examples show is that the question as to whether it is possible to 
compute a bound on a rate of convergence of a sequence from some initial data 
is not a question about the speed of the sequence's convergence, but, rather, its 
predictability. In this section, we show that in general, one cannot compute a 
bound on the rate of convergence of ergodic averages from the initial data, al- 
though one can do so when dealing when dealing with an ergodic transformation 
of a (finite) measure space. 

The results in this section presuppose notions of computability for various 
objects of analysis. There are a number of natural, and equivalent, frameworks 
for defining such notions. For complete detail, the reader should consult Pour 
el and Richards or Weihrauch [33] • To make sense of the results below, 
however, the following sketchy overview should suffice. 

The general strategy is to focus on infinitary objects that can be represented 
with a countable set of data. For example, a real number can be taken to be 
represented by a sequence of rational numbers together with a bound on its rate 
of convergence; the corresponding real number is said to be computable if it has 
a computable representation. In other words, a computable real number is given 
by computable functions a : N ^ Q and r : Q ^ N with the property that for 
every rational e > 0, — ar(e)\ < £ for every m > r(e). A function taking 
infinitary objects as arguments is said to be computable if the output can be 
computed by a procedure that queries any legitimate representation of the input. 
For example, a computable function f{x) from R to R is given by an algorithm 
which, given the ability to request arbitrarily good rational approximations to 
produces arbitrarily good rational approximations to y = f{x), in the sense 
above. In other words, / is given by algorithms that compute functions ay and 
Ty representing y, given the ability to query "oracles" Ux and representing x. 
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Similar considerations apply to separable Hilbert spaces, which are assumed 
to come with a fixed choice of basis. An element of the space can be represented 
by a sequence of finite linear combinations of basis elements together with a 
bound on their rate of rate of convergence in the Hilbert space norm; once again, 
such an element is said to be computable if it has a computable representation. 
The inner product and norm are then computable operations on the entire 
space. A bounded linear operator can be represented by the sequence of values 
on elements of the basis, and is computable if that sequence is. In general, 
a computable bounded linear operator need not have a computable norm (see 

mm)- 

Computability with respect to a measure space can be understood in similar 
ways. A measurable function is represented by a sequence of suitably simple 
functions that approximate it in the norm, together with a rate of conver- 
gence. Note that this means that a measurable function is represented only up 
to a.e. equivalence. One can associate to any measure preserving operator r 
the bounded linear operator Tf = / o r, and take r to be represented by any 
representative of the associated T. 

The following theorem shows that it is not always possible to compute a 
bound on the rate of convergence of a sequence of ergodic averages from the 
initial data. 

Theorem 5.1 There are a computable measure preserving transformation of 
[0, 1] under Lebesgue measure and a computable characteristic function f = XA 
such that if f* = lim„^„/, then ||/*||2 is not a computable real number. 

In particular, f* is not a computable element of the Hilbert space, and there 
is no computable bound on the rate of convergence of (Anf) in either the or 

norm. Nor is there a bound on the pointwise rate of convergence of (Anf), 
in the sense of Theorem \3.'A 

Proof. It suffices to prove the assertion in the first sentence. The rest of the 
assertions follow, since if /* were computable, then ||/*||2 would be computable, 
and if there were a computable bound on the rate of convergence of {Anf) in 
the norm, then /* would be a computable element of i^([0, 1]). Computable 
bounds on the rate of convergence in either of the other senses mentioned in 
the remainder of the theorem would imply a computable bound on the rate of 
convergence in the L"^ norm. 

To prove the assertion in the first sentence, we use a variant of constructions 
described in [2l[29|. First, suppose / is the characteristic function of the interval 
[0, 1/2), and r is the rotation tx = {x + a) mod 1, where a is either or 1/2'' 
for some j > 1. If a = 0, then f* — f and ||/*||2 — 1/2. If a = 1/2-' for any 
j > 1, then /* is the constant function equal to 1/2, and ||/*||2 = 1/4. Thus 
knowing ||/*||2 allows us to determine whether a — 0. 

Our strategy will be to divide [0, 1) into intervals [1 — 2*, 1 — 2*+^), and let r 
rotate each interval by a computable real number that depends on whether 
the ith Turing machine halts. With a suitable choice of /, the limit /* of the 
sequence (Anf) will then encode information as to which Turing machines halt 
on input 0. 
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The details are as follows. Let T{e, x, s) be Kleene's T predicate, which 
asserts that s codes a halting computation sequence of Turing machine e on 
input X. The predicate T is computable, but the set {e | 3s r(e,0, s)} is not. 
Without loss of generality, we can assume that for any e and x there is at most 
one s such that T{e,x,s) holds. We will prove the theorem by constructing 
computable r and / such that {e | 3s T{e,0, s)} is computable from ||/*||2. 

Define the computable sequence (a^) of computable reals by setting 

1/2*+^+^ for the unique j satisfying T{i,0,j), if there is one 
otherwise 

Let T be the measure preserving transformation that rotates each interval [1 — 
2',1 — 2'+^) by Qi. To see that the sequence (a^) is computable, remember 
that we only need to by able to compute approximations to the a^'s uniformly; 
we can do this by testing T{i,0,j) up to a sufficiently large value of j. To 
see that r is computable, remember that it is sufRcient to be able to compute 
approximations to the value of T applied to any simple function, given rational 
approximations to the the a^'s. 

Let / be the characteristic function of the set IJJl — 2', 1 — 3 • 2*+^), so that 
/ is equal to 1 on the left half of each interval [1 — 2*, 1 — 2*+^) and on the 
right half. Let /* = lim„A„/. Then 

II/II2 = X! I ' oi+T + X! 



and 



4 2*+i ^-^ 2 2*+i 

{t I Bj T(j,Oj)} {j I -aj T{i,0,3)} 



' 'irii2 = E^^ -11/112= ^ — 



2 uJ 112 2 2*+i ^-^ 2*+'^' 

iSN {i I 3j T(i,Oj)} 



Calling this last expression r, it suffices to show that {i \ 3j T{i,0,j)} is com- 
putable from r. But the argument is now standard (see [28l Section 0.2, Corol- 
lary 2a] or [Sni Theorem III.2.2]). For each n, let 



Then the sequence (r„) is computable and increases monotonically to r. To 
determine whether Turing machine i halts on input 0, it suffices to search for 
an n and an approximation to r sufficiently good to ensure |r — r„| < 1/2*"'"'^. 
Then we only need to check if there is a j < n such that T{i,0,j) holds; if there 
isn't, T{i, 0,j) is false for every j. □ 



The proof of Theorem 15. II relied on the fact that the system we constructed 
is not ergodic; we used the behavior of the system on each ergodic component 
to encode the behavior of a Turing machine. The next two theorems and their 
corollary show that if, on the other hand, the space in question is ergodic, then 
one always has a computable rate of convergence. 
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Theorem 5.2 Let T be a nonexpansive linear operator on a separable Hilbert 
space and let f be an element of that space. Let f* = lim„ A„ f- Then f*, and 
a bound on the rate of convergence of (A„/) in the Hilbert space norm, can be 
computed from f, T, and ||/*||. 

Proof. It suffices to sliow that one can compute a bound on the rate of conver- 
gence of {Anf) from the given data. Assuming / is not already a fixed point 
of T, write f = f* + g, and let the sequences {gi), (ui), and (a^) be defined as 
in Section O Then g = lim^ g^, and gi = Ui — Tui and — \\gi\\ for every i. 
Let a ~ lim^ a^. Then a = \\g\\ ~ \/YfW~~^f*W can be computed from / and 
|j/*||2- For any m, n > m, and i, we have 

\\Amf-Anf\\ = \\A^g-Ang\\ 

< \\Amgi - AngiW + \\A^{g - g^)\\ + \\An{g - g^)\\ 

< \\A^g,\\ + \\A^g,\\+2\\g^ g,\\ 

< \\A„,gi\\ + \\Ang^\\ + 2y/2{a ~ a,)\\f\\ 

as in the proof of Lemma 12.51 Given e, using the given data we can now find 
an i such that the last term on the right hand side is less than e/2, compute 
Ui, and then, using Lemma 12.31 2. determine an m large enough so that for any 
n>m, WA^giW + ||A„g,|| < e/2. □ 



Theorem 5.3 Let X = {X,B,[l) be a separable measure space, let t be a mea- 
sure preserving transformation of X, and let T be the associated Koopman op- 
erator. Then for any f in L^{X), bounds on the rate of convergence in the L^ 
norm, in the L^ norm, and in the sense of Theorem \3.S\ can be computed from 
f, T, and \\f*\\2. 

Proof. The previous theorem provides bounds on the rate of convergence in the 
L^ norm, and hence in the L^ norm as well. 

For convergence in the sense of Theorem 13. 2[ consider inequality ([6|) , where 
now h is /* and fi is gi + /*: 

\A^f{x) - A,J{x)\ < \A^{f - (g, + f*)){x)\ 

+ \A,n9rix)\ + \Angrix)\ + |A„(/ ~ (g, + f*)){x)\. 

The sequence (/ — {gi + /*))iGN converges to in the L^ norm, and, as in 
the proof of Theorem 15.21 we can compute a bound on the rate of convergence 
from the given data. Using CoroUarv 13.51 we can make the first and last terms 
small outside of a small set of exceptions, independent of m and n, by making i 
sufficiently large. Using Lemma [3.91 we can then determine how large n has to 
be so that the remaining terms are small outside a small set of exceptions, for 
all m > n. □ 
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Corollary 5.4 With X = (X,B,^) as above, suppose r is an ergodic measure 
preserving transformation. Then for any f in L^{X), bounds on the rate of 
convergence in the norm, in the norm, and in the sense of Theorem lS.^ 
can be computed from f, T, and /i. 

For any f in L^(X), bounds on the rate of convergence in the norm and in 
the sense of Theorem \3.^ can be computed from T, /i, and a sequence of L'^{X) 
functions approximating f in the norm (together with a rate of convergence) . 

Proof. If the system is ergodic, /* is a.e. equal to the constant / / dfi, in which 
case ||/*||2 = 1/ / d^\- Thus ||/*||2 is computable from / and ^, and we can 
apply the previous theorem. 

Suppose now f E and we are given a sequence (fi) of functions 
approaching / in the norm, together with a rate of convergence. Since 

\\Anf-Anf,\\l = P„(/-/OI|l < 11/ -/.111 

for every n, we can make ||^m/ — ^n/||i small by first picking i large enough and 
then ensuring that ||Am/i — ^n/i||i is small. Similarly, we can make \Amf{x) — 
Anf{x)\ small outside a small set of exceptions by first choosing i sufficiently 
large, applying Corollary 13.51 and then using the previous theorem to choose 
n large enough so that \Amfi{x) — Anfi{x)\ is small outside a small set of 
exceptions for every m > n. □ 

The issues raised here can be considered from a spectral standpoint as well. 
If T is a unitary transformation of a Hilbert space, then the spectral measure 
af associated to / can be described in the following way. For each k G Z, let 
bk = {T^ fi f) be the /cth autocorrelation coefficient of /. Let T be the circle 
with radius 1, identified with the interval [0, 27r). Let I be the linear operator 
on the complex Hilbert space L'^iT) defined with respect to the basis (e''^^)fegz 
by I{e^^^) — bk. The sequence b^ is a positive definite sequence, and so by 
Bochner's theorem (see [HI US]), there is a positive measure ct/ on T such that 
I{g) = J g daj. It is well known that ||/*||2 — c/ljO}), and Kachurovskii [151 
page 670] shows that if /* = 0, then for every n and S e (0,7r), 

\\A,jh<JcTf{^6,s)+ ^."fji; 

V nsm(d/2) 

This last expression shows that, in the case where /* = 0, one can compute a 
bound the rate of convergence of (A^f) from a bound on the rate of convergence 
of af{—6,6) as 6 approaches 0. The problem is that / is not necessarily a 
bounded linear transformation, and so cr/ is not generally computable from /. 
Theorem 15.21 above shows that for any / it is nonetheless possible to compute 
r from (T/({0}), /, and T. 

For any set of natural numbers X , let X' denote the halting problem relative 
to X. The proof of Theorem 15.11 shows . more generally, the following: 

Theorem 5.5 For any set of natural numbers X, there are a Lebesgue-measure 
preserving transformation r of [0,1], computable from X, and a computable 
element f o/L^([0, 1]), such that X' is computable from ||/*||2- 



27 



The results in this section can be adapted to yield information with respect 
to provability in restricted axiomatic frameworks. Constructive mathematics, 
for example, aims to use only principles that can be given a direct computa- 
tional interpretation (see, for example, [HIT])- There is also a long tradition of 
developing mathematics in classical theories that are significantly weaker than 
set theory. In the field of reverse mathematics, this is done with an eye towards 
calibrating the degree of nonconstructivity of various theorems of mathematics 
(see |30)): in the field of proof mining, this is done with an eye towards mining 
proofs for additional information (see Section [S] below). 

When a theorem of modern mathematics is not constructively valid, one can 
search for an "equal hypothesis" substitute, i.e. a constructive theorem with 
the same hypotheses, and with a conclusion that is easily seen to be classically 
equivalent to the original theorem. Bishop's upcrossing inequalities, as well as 
the results of Spitters [31] , are of this form. The results of Sections [2] and [3] are 
also of this form, and are provable both constructively and in the weak base 
theory RCAg of reverse mathematics. One can also look for "equal conclu- 
sion" substitutes, by seeking classically equivalent but constructively stronger 
hypotheses. Theorem l5.2l has this flavor, but it is hard to see how one can turn 
it into a constructive theorem, because it is not clear how one can refer to ||/*||2 
without presupposing that (A„/) converges. One can show, constructively and 
in RCAg, that if the projection of / on the subspace N described in the proof 
of Theorem 12.11 exists then (A„/) converges; but the assumption that the pro- 
jection of / on M exists is not sufficient (see [2131] and the corrigendum to the 
latter). An interesting equal conclusion constructive version of the pointwise 
ergodic theorem can be found in Nuber [27] . 

Theorem 15.11 shows that the mean and pointwise ergodic theorems do not 
have constructive proofs. In fact, in the setting of reverse mathematics, they 
are equivalent to a set-existence principle known as arithmetic comprehension 
over RCAg- For stronger results, see [2l[29]. 

6 Proof-theoretic techniques 

The methods we have used in Sections [2| and [3| belong to a branch of mathemat- 
ical logic called "proof mining," where the aim is to develop general techniques 
that allow one to extract additional information from nonconstructive or inef- 
fective mathematical proofs. The program is based on two simple observations: 
first, ordinary mathematical proofs can typically be represented in formal sys- 
tems that are much weaker then axiomatic set theory; and, second, proof theory 
provides general methods of analyzing formal proofs in such theories, with an 
eye towards locating their constructive content. Traditional research has aimed 
to show that many classical theories can be reduced to constructive theories, 
at least in principle, and has developed a variety of techniques for establishing 
such reductions. These include double-negation translations, cut-elimination, 
Herbrand's theorem, realizability, and functional interpretations. (The Hand- 
book of Proof Theory 8 provides an overview of the range of methods.) Proof 
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mining involves adapting and specializing these techniques to specific mathe- 
matical domains where additional information can fruitfully be sought. 

Our constructive versions of the mean and pointwise ergodic theorems are 
examples of Kreisel's no-counterexample interpretation [521 Effective proofs 
of such translated statements can often be obtained using variants of Godel's 
functional ( "Dialectica" ) interpretation [TU] (see also [T]). Ulrich Kohlenbach 
has shown that the Dialectica interpretation can be used as an effective tool; 
see, for example, [HJUT]. For example, our constructive mean ergodic theorem. 
Theorem 12.161 provides bounds that depend only on K and In fact, 

the usual proofs of the mean ergodic theorem can be carried out in axiomatic 
frameworks for which the general metamathematical results of Gerhardy and 
Kohlenbach [9] guarantee such uniformity. 

While the methods of the paper just cited do show how one can find an 
explicit expression for the requisite bound, the resulting expression would not 
yield, a priori, useful bounds. For that, a more refined analysis, due to Kohlen- 
bach [TH|, can be used. The nonconstructive content of the Riesz proof of the 
mean ergodic theorem can be traced to the use of the principle of convergence 
for bounded monotone sequences of real numbers. In formal symbolic terms, 
the fact that every bounded increasing sequence of real numbers converges can 
be expressed as follows: 

Va : N ^ R, c e R(Vi (a, < a,+i < c) ^ 

Ve > 3n Vm > n {\am — Onl < £))■ 

Using a principle known as "arithmetic comprehension," we can conclude that 
there is a function, r, bounding the rate of convergence: 

Va : N ^ R, c G M (Vi (a; < a,+i < c) ^ 

3r Ve > Vm > r{e) (|a„ - ar(e) I < e))- (10) 

In general, r cannot be computed from the sequence (a^). On the other hand, 
the proof of Theorem 15.21 shows that witnesses to the mean ergodic theorem 
can be computed from a bound r on the rate of convergence, for a sequence 
(oi) that is explicitly computed from T and /. Moreover, the proof of this fact 
can be carried out in a weak theory. Kohlenbach's results show that, in such 
situations, one can compute explicit witnesses to the Dialectica translation to 
the theorem in question from a weaker version of principle ()10p : 

Va : N R, c e R (Vi {a, < a,+i < c) ^ 

Ve > 0,M3n {M{n) > n i\aM{n) ~ a«| < e))- 

This last principle can be given a clear computational interpretation: given e 
and M, one can iteratively compute 0, M(0), Af (Af(0)), . . . until one finds a 
value of n such that |ajvf(n) ~ o-n\ < £• This information can then be used to 
witness the Dialectica translation of the conclusion, that is, our constructive 
mean ergodic theorem. 
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This strategy is clearly in evidence in Section [21 In practice, it is both in- 
feasible and unnecessary to express the initial proof in completely formal terms. 
Rather, one undertakes a good deal of heuristic manipulation of the original 
proof, using the translation to determine what form intermediate lemmas should 
have, and how they should be combined. The metamathematical results are 
therefore used as a guide, providing both guarantees as to what results can be 
achieved, and the strategies needed to achieve them. 
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